Using Java for distributed computing in the Gaia satellite data processing

نویسندگان

  • William O'Mullane
  • Xavier Luri
  • Paul Parsons
  • Uwe Lammers
  • John Hoar
  • Jose Hernandez
چکیده

In recent years Java has matured to a stable easy-to-use language with the flexibility of an interpreter (for reflection etc.) but the performance and type checking of a compiled language. When we started using Java for astronomical applications around 1999 they were the first of their kind in astronomy. Now a great deal of astronomy software is written in Java as are many business applications. We discuss the current environment and trends concerning the language and present an actual example of scientific use of Java for high-performance distributed computing: ESA’s mission Gaia. The Gaia scanning satellite will perform a galactic census of about 1000 million objects in our galaxy. The Gaia community has chosen to write its processing software in Java. We explore the manifold reasons for choosing Java for this large science collaboration. Gaia processing is numerically complex but highly distributable, some parts being embarrassingly parallel. We describe the Gaia processing architecture and its realisation in Java. We delve into the astrometric solution which is the most advanced and most complex part of the processing. The Gaia simulator is also written in Java and is the most mature code in the system. This has been successfully running since about 2005 on the supercomputer ”Marenostrum” in Barcelona. We relate experiences of using Java on a large shared machine. Finally we discuss Java, including some of its problems, for scientific computing. PACS PACS 07.05.Kf · PACS 95.10.Jk · PACS 07.87.+v

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Solution for Satellite Image Processing on Grids

Remote sensing image processing is both data and computing intensive. Grid technologies currently provides powerful tools for remote sensing data sharing and processing. After an overview of the recent initiatives of gridifying satellite image processing, two specific usage scenarios are analyzed. The solution that is proposed is based on freely distributed and general-purpose software: latest ...

متن کامل

Eric : A User and Applications Interface to a Distributed Satellite Data Repository

We describe a distributed computational infrastructure for accessing and processing a large repository of geostationary satellite data through a World Wide Web browser interface. Our repository of GMS5 satellite data is stored on a combined RAID and tape silo system, accessible from a cluster of DEC Alpha workstations, interconnected by ATM LAN technology locally (in Adelaide) and also availabl...

متن کامل

A Web-based Interface for On-Demand Processing of Satellite Imagery Archives

We describe a web-based control system for invoking pipelined processes on a large on-line archive of geostationary satellite imagery through a World Wide Web browser interface. Our archive of GMS5 satellite data is stored on a combined RAID and tape silo system, accessible from a cluster of ATM-connected DEC Alpha workstations in Adelaide and Canberra. Our system makes use of parallel and dist...

متن کامل

Entropy-based Consensus for Distributed Data Clustering

The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...

متن کامل

Development of a Model for Predicting Heart Attack Based on Fog Computing

Introduction: Various studies have demonstrated the benefits of using distributed fog computing for the Internet of Things (IoT). Fog computing has brought cloud computing capabilities such as computing, storage, and processing closer to IoT nodes. The new model of fog and edge computing, compared to cloud computing, provides less latency for data processing by bringing resources closer to user...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1108.0355  شماره 

صفحات  -

تاریخ انتشار 2011